Frame level likelihood normalization for text-independent speaker identification using Gaussian mixture models

نویسندگان

Konstantin Markov

Seiichi Nakagawa

چکیده

In this paper we propose a new speaker identi cation system, where the likelihood normalization technique, widely used for speaker veri cation, is introduced. In the new system, which is based on Gaussian Mixture Models, every frame of the test utterance is inputed to all the reference models in parallel. In this procedure, for each frame, likelihoods from all the models are available, hence they can be normalized at every frame. A special kind of likelihood normalization, called Weighting Models Rank, is also proposed. Experiments were performed using two databases TIMIT and NTT. Evaluation results clearly show that frame level likelihood normalization technique is superior to the standard accumulated likelihood approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Text-independent speaker recognition using non-linear frame likelihood transformation

When the reference speakers are represented by Gaussian mixture model (GMM), the conventional approach is to accumulate the frame likelihoods over the whole test utterance and compare the results as in speaker identi®cation or apply a threshold as in speaker veri®cation. In this paper we describe a method, where frame likelihoods are transformed into new scores according to some non-linear func...

متن کامل

Speaker verification using frame and utterance level likelihood normalization

In this paper, we propose a new method, where the likelihood normalization technique is applied at both the frame and utterance levels. In this method based on Gaussian Mixture Models (GMM), every frame of the test utterance is inputed to the claimed and all background speaker models in parallel. In this procedure, for each frame, likelihoods from all the background models are available, hence ...

متن کامل

Text-Independent Speaker Recognition Using Gaussian Mixture Models Final Term Paper Proposal

The proposed project is an implementation of speaker recognition systems, both identification and verification. The systems are built using Gaussian Mixture Models, as proposed in several papers from Douglas A. Reynolds. The use of Fractional Covariance Matrix is studied as an possible increase for the traditional recognition systems. keywords: speaker recognition; Gaussian Mixture Models; like...

متن کامل

International Journal of Emerging trends in Engineering and Development ISSN 2249-6149 Available online on http://www.rspublication.com/ijeted/ijeted_index.htm Issue 2, Vol.5 (July 2012)

In This paper presents an overview of a state-of-the-art text-independent speaker verification system. First, an introduction proposes a modular scheme of the training and test phases of a speaker verification system. Then, the most commonly speech parameterization used in speaker verification, namely, cepstral analysis, is detailed. Gaussian mixture modeling, which is the speaker modeling tech...

متن کامل

Discriminative training of GMM using a modified EM algorithm for speaker recognition

In this paper, we present a new discriminative training method for Gaussian Mixture Models (GMM) and its application for the text-independent speaker recognition. The objective of this method is to maximize the frame level normalized likelihoods of the training data. That is why we call it the Maximum Normalized Likelihood Estimation (MNLE). In contrast to other discriminative algorithms, the o...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1996

Frame level likelihood normalization for text-independent speaker identification using Gaussian mixture models

نویسندگان

چکیده

منابع مشابه

Text-independent speaker recognition using non-linear frame likelihood transformation

Speaker verification using frame and utterance level likelihood normalization

Text-Independent Speaker Recognition Using Gaussian Mixture Models Final Term Paper Proposal

International Journal of Emerging trends in Engineering and Development ISSN 2249-6149 Available online on http://www.rspublication.com/ijeted/ijeted_index.htm Issue 2, Vol.5 (July 2012)

Discriminative training of GMM using a modified EM algorithm for speaker recognition

عنوان ژورنال:

اشتراک گذاری